Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 29240 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.9 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Text | 2 |
| Categorical | 3 |
F_AMOUNT_TRANSACTION is highly overall correlated with N_CURRENCY_CODE_TRANSACTION | High correlation |
ID_TRX is highly overall correlated with N_TRANSMISSION_DATE_AND_TIME | High correlation |
N_ACQ_INSTITUTION_COUNTRY_CODE is highly overall correlated with N_CURRENCY_CODE_TRANSACTION | High correlation |
N_CURRENCY_CODE_TRANSACTION is highly overall correlated with F_AMOUNT_TRANSACTION and 1 other fields | High correlation |
N_TRANSMISSION_DATE_AND_TIME is highly overall correlated with ID_TRX | High correlation |
N_POINT_OF_SERV_COND_CODE is highly imbalanced (93.1%) | Imbalance |
F_AMOUNT_TRANSACTION is highly skewed (γ1 = 136.3851016) | Skewed |
F_DOLLAR_AMOUNT is highly skewed (γ1 = 20.57713058) | Skewed |
ID_TRX has unique values | Unique |
Reproduction
| Analysis started | 2024-02-29 02:54:34.599641 |
|---|---|
| Analysis finished | 2024-02-29 02:54:48.934229 |
| Duration | 14.33 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
ID_TRX
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 29240 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.072731 × 109 |
| Minimum | 5.0655055 × 108 |
|---|---|
| Maximum | 1.6627845 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 5.0655055 × 108 |
|---|---|
| 5-th percentile | 5.683798 × 108 |
| Q1 | 7.645199 × 108 |
| median | 1.0543377 × 109 |
| Q3 | 1.3782228 × 109 |
| 95-th percentile | 1.6050329 × 109 |
| Maximum | 1.6627845 × 109 |
| Range | 1.1562339 × 109 |
| Interquartile range (IQR) | 6.1370286 × 108 |
Descriptive statistics
| Standard deviation | 3.3767034 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.31477635 |
| Kurtosis | -1.2521703 |
| Mean | 1.072731 × 109 |
| Median Absolute Deviation (MAD) | 3.0670248 × 108 |
| Skewness | 0.069225837 |
| Sum | 3.1366654 × 1013 |
| Variance | 1.1402126 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 508924809 | 1 | < 0.1% |
| 1409285154 | 1 | < 0.1% |
| 1403738609 | 1 | < 0.1% |
| 1380962454 | 1 | < 0.1% |
| 1374590576 | 1 | < 0.1% |
| 1372321799 | 1 | < 0.1% |
| 1360071657 | 1 | < 0.1% |
| 1347126092 | 1 | < 0.1% |
| 1345864447 | 1 | < 0.1% |
| 1345467058 | 1 | < 0.1% |
| Other values (29230) | 29230 |
| Value | Count | Frequency (%) |
| 506550552 | 1 | |
| 506550741 | 1 | |
| 506562485 | 1 | |
| 506566788 | 1 | |
| 506592649 | 1 | |
| 506696925 | 1 | |
| 506722556 | 1 | |
| 506766633 | 1 | |
| 506772052 | 1 | |
| 506789927 | 1 |
| Value | Count | Frequency (%) |
| 1662784452 | 1 | |
| 1662774351 | 1 | |
| 1662774095 | 1 | |
| 1662751677 | 1 | |
| 1662704696 | 1 | |
| 1662638437 | 1 | |
| 1662610375 | 1 | |
| 1662608761 | 1 | |
| 1662604008 | 1 | |
| 1662603967 | 1 |
S_PAN
Text
| Distinct | 14387 |
|---|---|
| Distinct (%) | 49.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 228.6 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Characters and Unicode
| Total characters | 467840 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11017 ? |
|---|---|
| Unique (%) | 37.7% |
Sample
| 1st row | 488245******4567 |
|---|---|
| 2nd row | 410443******1014 |
| 3rd row | 434527******5015 |
| 4th row | 410443******0018 |
| 5th row | 478787******2328 |
| Value | Count | Frequency (%) |
| 423087******9005 | 82 | 0.3% |
| 423087******5002 | 81 | 0.3% |
| 423087******0001 | 79 | 0.3% |
| 423087******7009 | 78 | 0.3% |
| 423087******7004 | 76 | 0.3% |
| 423087******4004 | 76 | 0.3% |
| 423087******2005 | 76 | 0.3% |
| 423087******6004 | 75 | 0.3% |
| 423087******3002 | 75 | 0.3% |
| 423087******2001 | 74 | 0.3% |
| Other values (14377) | 28468 |
Most occurring characters
| Value | Count | Frequency (%) |
| * | 175440 | |
| 0 | 54129 | 11.6% |
| 4 | 50699 | 10.8% |
| 7 | 35381 | 7.6% |
| 8 | 29193 | 6.2% |
| 3 | 25196 | 5.4% |
| 2 | 24922 | 5.3% |
| 5 | 20764 | 4.4% |
| 1 | 20138 | 4.3% |
| 9 | 19040 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 292400 | |
| Other Punctuation | 175440 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 54129 | |
| 4 | 50699 | |
| 7 | 35381 | |
| 8 | 29193 | |
| 3 | 25196 | |
| 2 | 24922 | |
| 5 | 20764 | 7.1% |
| 1 | 20138 | 6.9% |
| 9 | 19040 | 6.5% |
| 6 | 12938 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 175440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 467840 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| * | 175440 | |
| 0 | 54129 | 11.6% |
| 4 | 50699 | 10.8% |
| 7 | 35381 | 7.6% |
| 8 | 29193 | 6.2% |
| 3 | 25196 | 5.4% |
| 2 | 24922 | 5.3% |
| 5 | 20764 | 4.4% |
| 1 | 20138 | 4.3% |
| 9 | 19040 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 467840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| * | 175440 | |
| 0 | 54129 | 11.6% |
| 4 | 50699 | 10.8% |
| 7 | 35381 | 7.6% |
| 8 | 29193 | 6.2% |
| 3 | 25196 | 5.4% |
| 2 | 24922 | 5.3% |
| 5 | 20764 | 4.4% |
| 1 | 20138 | 4.3% |
| 9 | 19040 | 4.1% |
S_ENCRYPTED_PAN
Real number (ℝ)
| Distinct | 22309 |
|---|---|
| Distinct (%) | 76.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0001039 × 1015 |
| Minimum | 1 × 1015 |
|---|---|
| Maximum | 1.0002567 × 1015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 1 × 1015 |
|---|---|
| 5-th percentile | 1.0000008 × 1015 |
| Q1 | 1.0000077 × 1015 |
| median | 1.0000666 × 1015 |
| Q3 | 1.0002022 × 1015 |
| 95-th percentile | 1.0002413 × 1015 |
| Maximum | 1.0002567 × 1015 |
| Range | 2.5674814 × 1011 |
| Interquartile range (IQR) | 1.9447214 × 1011 |
Descriptive statistics
| Standard deviation | 9.1528062 × 1010 |
|---|---|
| Coefficient of variation (CV) | 9.1518553 × 10-5 |
| Kurtosis | -1.6155555 |
| Mean | 1.0001039 × 1015 |
| Median Absolute Deviation (MAD) | 6.3440494 × 1010 |
| Skewness | 0.24884125 |
| Sum | -7.6504498 × 1018 |
| Variance | 8.3773862 × 1021 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.000007246 × 1015 | 32 | 0.1% |
| 1.000003311 × 1015 | 32 | 0.1% |
| 1.000209985 × 1015 | 31 | 0.1% |
| 1.000002614 × 1015 | 31 | 0.1% |
| 1.000004212 × 1015 | 30 | 0.1% |
| 1.000000295 × 1015 | 29 | 0.1% |
| 1.000119843 × 1015 | 28 | 0.1% |
| 1.000002075 × 1015 | 28 | 0.1% |
| 1.000017191 × 1015 | 28 | 0.1% |
| 1.000002493 × 1015 | 27 | 0.1% |
| Other values (22299) | 28944 |
| Value | Count | Frequency (%) |
| 1.000000001 × 1015 | 1 | |
| 1.000000001 × 1015 | 2 | |
| 1.000000002 × 1015 | 1 | |
| 1.000000004 × 1015 | 1 | |
| 1.000000004 × 1015 | 1 | |
| 1.000000004 × 1015 | 1 | |
| 1.000000005 × 1015 | 1 | |
| 1.000000005 × 1015 | 1 | |
| 1.000000005 × 1015 | 1 | |
| 1.000000008 × 1015 | 1 |
| Value | Count | Frequency (%) |
| 1.000256749 × 1015 | 1 | |
| 1.000256529 × 1015 | 1 | |
| 1.000256476 × 1015 | 1 | |
| 1.00025647 × 1015 | 1 | |
| 1.000256452 × 1015 | 1 | |
| 1.000256441 × 1015 | 1 | |
| 1.000256379 × 1015 | 1 | |
| 1.00025627 × 1015 | 1 | |
| 1.000256253 × 1015 | 1 | |
| 1.000256151 × 1015 | 1 |
F_AMOUNT_TRANSACTION
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 9662 |
|---|---|
| Distinct (%) | 33.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17227.87 |
| Minimum | -70000 |
|---|---|
| Maximum | 2.0779638 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 80 |
| Negative (%) | 0.3% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | -70000 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 10.99 |
| median | 126.34 |
| Q3 | 2300 |
| 95-th percentile | 16211.493 |
| Maximum | 2.0779638 × 108 |
| Range | 2.0786638 × 108 |
| Interquartile range (IQR) | 2289.01 |
Descriptive statistics
| Standard deviation | 1364286.5 |
|---|---|
| Coefficient of variation (CV) | 79.190667 |
| Kurtosis | 19596.176 |
| Mean | 17227.87 |
| Median Absolute Deviation (MAD) | 125.34 |
| Skewness | 136.3851 |
| Sum | 5.0374293 × 108 |
| Variance | 1.8612778 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1162 | 4.0% |
| 5.99 | 811 | 2.8% |
| 100 | 565 | 1.9% |
| 9.99 | 493 | 1.7% |
| 630 | 463 | 1.6% |
| 4.99 | 432 | 1.5% |
| 3000 | 414 | 1.4% |
| 0.99 | 389 | 1.3% |
| 50 | 362 | 1.2% |
| 25 | 322 | 1.1% |
| Other values (9652) | 23827 |
| Value | Count | Frequency (%) |
| -70000 | 1 | |
| -67700 | 1 | |
| -63500 | 1 | |
| -50000 | 1 | |
| -25000 | 1 | |
| -20000 | 2 | |
| -17350 | 1 | |
| -15151 | 1 | |
| -15000 | 1 | |
| -14000 | 1 |
| Value | Count | Frequency (%) |
| 207796380 | 1 | |
| 104904965.7 | 1 | |
| 7772996 | 1 | |
| 7000000 | 1 | |
| 6621043 | 1 | |
| 3981922 | 1 | |
| 3691900 | 1 | |
| 3608218 | 1 | |
| 2188794 | 1 | |
| 1927181 | 1 |
N_CURRENCY_CODE_TRANSACTION
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 501.958 |
| Minimum | 32 |
|---|---|
| Maximum | 986 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 188 |
| Q1 | 188 |
| median | 340 |
| Q3 | 840 |
| 95-th percentile | 840 |
| Maximum | 986 |
| Range | 954 |
| Interquartile range (IQR) | 652 |
Descriptive statistics
| Standard deviation | 306.69372 |
|---|---|
| Coefficient of variation (CV) | 0.61099477 |
| Kurtosis | -1.8631959 |
| Mean | 501.958 |
| Median Absolute Deviation (MAD) | 152 |
| Skewness | 0.15081507 |
| Sum | 14677252 |
| Variance | 94061.036 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 840 | 12328 | |
| 188 | 10916 | |
| 340 | 4553 | 15.6% |
| 978 | 213 | 0.7% |
| 484 | 186 | 0.6% |
| 124 | 179 | 0.6% |
| 214 | 125 | 0.4% |
| 170 | 101 | 0.3% |
| 826 | 76 | 0.3% |
| 784 | 65 | 0.2% |
| Other values (47) | 498 | 1.7% |
| Value | Count | Frequency (%) |
| 32 | 11 | < 0.1% |
| 36 | 19 | 0.1% |
| 50 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 68 | 4 | < 0.1% |
| 84 | 49 | 0.2% |
| 124 | 179 | |
| 136 | 4 | < 0.1% |
| 152 | 9 | < 0.1% |
| 156 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 986 | 23 | 0.1% |
| 985 | 4 | < 0.1% |
| 981 | 1 | < 0.1% |
| 978 | 213 | 0.7% |
| 949 | 8 | < 0.1% |
| 946 | 3 | < 0.1% |
| 928 | 2 | < 0.1% |
| 858 | 3 | < 0.1% |
| 840 | 12328 | |
| 834 | 1 | < 0.1% |
F_DOLLAR_AMOUNT
Real number (ℝ)
SKEWED 
| Distinct | 12786 |
|---|---|
| Distinct (%) | 43.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.387343 |
| Minimum | 0.007 |
|---|---|
| Maximum | 11678 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 0.007 |
|---|---|
| 5-th percentile | 0.998 |
| Q1 | 2.84 |
| median | 7.0535 |
| Q3 | 20.96 |
| 95-th percentile | 159.1702 |
| Maximum | 11678 |
| Range | 11677.993 |
| Interquartile range (IQR) | 18.12 |
Descriptive statistics
| Standard deviation | 272.21061 |
|---|---|
| Coefficient of variation (CV) | 5.6256574 |
| Kurtosis | 591.48166 |
| Mean | 48.387343 |
| Median Absolute Deviation (MAD) | 5.916 |
| Skewness | 20.577131 |
| Sum | 1414845.9 |
| Variance | 74098.618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 457 | 1.6% |
| 5.99 | 257 | 0.9% |
| 9.99 | 243 | 0.8% |
| 1.008 | 201 | 0.7% |
| 4.99 | 170 | 0.6% |
| 0.998 | 152 | 0.5% |
| 1.009 | 142 | 0.5% |
| 0.99 | 137 | 0.5% |
| 0.042 | 121 | 0.4% |
| 19.99 | 116 | 0.4% |
| Other values (12776) | 27244 |
| Value | Count | Frequency (%) |
| 0.007 | 1 | < 0.1% |
| 0.01 | 24 | 0.1% |
| 0.014 | 1 | < 0.1% |
| 0.016 | 3 | < 0.1% |
| 0.017 | 1 | < 0.1% |
| 0.018 | 1 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.03 | 4 | < 0.1% |
| 0.04 | 6 | < 0.1% |
| 0.041 | 76 |
| Value | Count | Frequency (%) |
| 11678 | 1 | |
| 10650.01 | 1 | |
| 10000 | 2 | |
| 9419.86 | 1 | |
| 8595.41 | 1 | |
| 7845.66 | 1 | |
| 7663.89 | 1 | |
| 7020.46 | 1 | |
| 6850.48 | 1 | |
| 6766.09 | 1 |
N_TRANSMISSION_DATE_AND_TIME
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 29224 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0208623 × 1013 |
| Minimum | 2.0201001 × 1013 |
|---|---|
| Maximum | 2.021113 × 1013 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 2.0201001 × 1013 |
|---|---|
| 5-th percentile | 2.0201026 × 1013 |
| Q1 | 2.0210117 × 1013 |
| median | 2.0210506 × 1013 |
| Q3 | 2.0210818 × 1013 |
| 95-th percentile | 2.0211109 × 1013 |
| Maximum | 2.021113 × 1013 |
| Range | 1.0129209 × 1010 |
| Interquartile range (IQR) | 7.0102528 × 108 |
Descriptive statistics
| Standard deviation | 3.8853951 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.00019226421 |
| Kurtosis | -0.0081577972 |
| Mean | 2.0208623 × 1013 |
| Median Absolute Deviation (MAD) | 3.2405363 × 108 |
| Skewness | -1.4005462 |
| Sum | 5.9090015 × 1017 |
| Variance | 1.5096295 × 1019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.021011608 × 1013 | 2 | < 0.1% |
| 2.021092423 × 1013 | 2 | < 0.1% |
| 2.020120214 × 1013 | 2 | < 0.1% |
| 2.021042615 × 1013 | 2 | < 0.1% |
| 2.021052924 × 1013 | 2 | < 0.1% |
| 2.021061011 × 1013 | 2 | < 0.1% |
| 2.021081523 × 1013 | 2 | < 0.1% |
| 2.02105311 × 1013 | 2 | < 0.1% |
| 2.021011414 × 1013 | 2 | < 0.1% |
| 2.021102612 × 1013 | 2 | < 0.1% |
| Other values (29214) | 29220 |
| Value | Count | Frequency (%) |
| 2.020100101 × 1013 | 1 | |
| 2.020100101 × 1013 | 1 | |
| 2.020100102 × 1013 | 1 | |
| 2.020100102 × 1013 | 1 | |
| 2.020100102 × 1013 | 1 | |
| 2.020100103 × 1013 | 1 | |
| 2.020100104 × 1013 | 1 | |
| 2.020100105 × 1013 | 1 | |
| 2.020100105 × 1013 | 1 | |
| 2.020100105 × 1013 | 1 |
| Value | Count | Frequency (%) |
| 2.021113022 × 1013 | 1 | |
| 2.021113021 × 1013 | 1 | |
| 2.021113021 × 1013 | 1 | |
| 2.021113021 × 1013 | 1 | |
| 2.02111302 × 1013 | 1 | |
| 2.021113019 × 1013 | 1 | |
| 2.021113019 × 1013 | 1 | |
| 2.021113019 × 1013 | 1 | |
| 2.021113019 × 1013 | 1 | |
| 2.021113019 × 1013 | 1 |
N_MERCHANT_TYPE
Real number (ℝ)
| Distinct | 221 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5656.3696 |
| Minimum | 1771 |
|---|---|
| Maximum | 9406 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 1771 |
|---|---|
| 5-th percentile | 4121 |
| Q1 | 4814 |
| median | 5814 |
| Q3 | 5818 |
| 95-th percentile | 7832 |
| Maximum | 9406 |
| Range | 7635 |
| Interquartile range (IQR) | 1004 |
Descriptive statistics
| Standard deviation | 1137.4609 |
|---|---|
| Coefficient of variation (CV) | 0.20109382 |
| Kurtosis | 1.3072733 |
| Mean | 5656.3696 |
| Median Absolute Deviation (MAD) | 193 |
| Skewness | 0.82384581 |
| Sum | 1.6539225 × 108 |
| Variance | 1293817.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4121 | 4769 | |
| 5816 | 3169 | 10.8% |
| 5812 | 1745 | 6.0% |
| 5818 | 1553 | 5.3% |
| 5815 | 1327 | 4.5% |
| 5734 | 1231 | 4.2% |
| 4812 | 1228 | 4.2% |
| 5817 | 1125 | 3.8% |
| 5942 | 905 | 3.1% |
| 5814 | 856 | 2.9% |
| Other values (211) | 11332 |
| Value | Count | Frequency (%) |
| 1771 | 4 | < 0.1% |
| 2741 | 5 | < 0.1% |
| 2842 | 1 | < 0.1% |
| 3000 | 15 | |
| 3001 | 4 | < 0.1% |
| 3007 | 1 | < 0.1% |
| 3014 | 1 | < 0.1% |
| 3039 | 6 | < 0.1% |
| 3052 | 2 | < 0.1% |
| 3058 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 9406 | 179 | |
| 9399 | 236 | |
| 9311 | 88 | 0.3% |
| 9222 | 1 | < 0.1% |
| 8999 | 157 | |
| 8931 | 2 | < 0.1% |
| 8911 | 5 | < 0.1% |
| 8734 | 1 | < 0.1% |
| 8699 | 31 | 0.1% |
| 8675 | 1 | < 0.1% |
N_ACQ_INSTITUTION_COUNTRY_CODE
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 611.35595 |
| Minimum | 32 |
|---|---|
| Maximum | 862 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 228.6 KiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 188 |
| Q1 | 372 |
| median | 702 |
| Q3 | 840 |
| 95-th percentile | 840 |
| Maximum | 862 |
| Range | 830 |
| Interquartile range (IQR) | 468 |
Descriptive statistics
| Standard deviation | 232.63892 |
|---|---|
| Coefficient of variation (CV) | 0.38052942 |
| Kurtosis | -1.1807545 |
| Mean | 611.35595 |
| Median Absolute Deviation (MAD) | 138 |
| Skewness | -0.46938145 |
| Sum | 17876048 |
| Variance | 54120.868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 840 | 8314 | |
| 528 | 4837 | |
| 826 | 4659 | |
| 372 | 2779 | 9.5% |
| 340 | 2541 | 8.7% |
| 188 | 1838 | 6.3% |
| 702 | 1602 | 5.5% |
| 591 | 925 | 3.2% |
| 344 | 194 | 0.7% |
| 124 | 181 | 0.6% |
| Other values (60) | 1370 | 4.7% |
| Value | Count | Frequency (%) |
| 32 | 3 | < 0.1% |
| 36 | 29 | 0.1% |
| 40 | 1 | < 0.1% |
| 56 | 110 | |
| 60 | 1 | < 0.1% |
| 68 | 4 | < 0.1% |
| 76 | 20 | 0.1% |
| 84 | 1 | < 0.1% |
| 100 | 3 | < 0.1% |
| 124 | 181 |
| Value | Count | Frequency (%) |
| 862 | 2 | < 0.1% |
| 858 | 1 | < 0.1% |
| 840 | 8314 | |
| 826 | 4659 | |
| 818 | 24 | 0.1% |
| 804 | 1 | < 0.1% |
| 792 | 6 | < 0.1% |
| 784 | 60 | 0.2% |
| 764 | 1 | < 0.1% |
| 756 | 5 | < 0.1% |
N_ENTRY_MODE
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 228.6 KiB |
| 10 | |
|---|---|
| 100 | |
| 102 | |
| 12 | |
| 11 | 16 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.4059508 |
| Min length | 2 |
Characters and Unicode
| Total characters | 70350 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 102 |
| 3rd row | 10 |
| 4th row | 10 |
| 5th row | 10 |
Common Values
| Value | Count | Frequency (%) |
| 10 | 14666 | |
| 100 | 8166 | |
| 102 | 3704 | 12.7% |
| 12 | 2688 | 9.2% |
| 11 | 16 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 10 | 14666 | |
| 100 | 8166 | |
| 102 | 3704 | 12.7% |
| 12 | 2688 | 9.2% |
| 11 | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34702 | |
| 1 | 29256 | |
| 2 | 6392 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 70350 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 34702 | |
| 1 | 29256 | |
| 2 | 6392 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 70350 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 34702 | |
| 1 | 29256 | |
| 2 | 6392 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 34702 | |
| 1 | 29256 | |
| 2 | 6392 | 9.1% |
N_POINT_OF_SERV_COND_CODE
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 228.6 KiB |
| 59 | |
|---|---|
| 1 | 241 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.9917579 |
| Min length | 1 |
Characters and Unicode
| Total characters | 58239 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 59 |
|---|---|
| 2nd row | 59 |
| 3rd row | 59 |
| 4th row | 59 |
| 5th row | 59 |
Common Values
| Value | Count | Frequency (%) |
| 59 | 28999 | |
| 1 | 241 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 59 | 28999 | |
| 1 | 241 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 28999 | |
| 9 | 28999 | |
| 1 | 241 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 58239 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 28999 | |
| 9 | 28999 | |
| 1 | 241 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 58239 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 28999 | |
| 9 | 28999 | |
| 1 | 241 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58239 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 28999 | |
| 9 | 28999 | |
| 1 | 241 | 0.4% |
| Distinct | 4852 |
|---|---|
| Distinct (%) | 16.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 228.6 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 16.078967 |
| Min length | 2 |
Characters and Unicode
| Total characters | 470149 |
|---|---|
| Distinct characters | 79 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3740 ? |
|---|---|
| Unique (%) | 12.8% |
Sample
| 1st row | GOOGLE *GSUITE_inversi |
|---|---|
| 2nd row | UBR* PENDING.UBER.COM |
| 3rd row | Google LLC GSUITE_iyta |
| 4th row | RAPPI |
| 5th row | FLEXI SHOES |
| Value | Count | Frequency (%) |
| 4504 | 6.8% | |
| paypal | 2433 | 3.7% |
| garena | 2254 | 3.4% |
| costa | 1848 | 2.8% |
| rica | 1826 | 2.8% |
| uber | 1821 | 2.7% |
| apple.com/bill | 1556 | 2.3% |
| ubr | 1534 | 2.3% |
| pending.uber.com | 1533 | 2.3% |
| boacompra | 1453 | 2.2% |
| Other values (5604) | 45504 |
Most occurring characters
| Value | Count | Frequency (%) |
| 39298 | 8.4% | |
| A | 34898 | 7.4% |
| E | 32115 | 6.8% |
| O | 31367 | 6.7% |
| R | 25030 | 5.3% |
| P | 22822 | 4.9% |
| L | 21166 | 4.5% |
| C | 19752 | 4.2% |
| G | 17798 | 3.8% |
| I | 16645 | 3.5% |
| Other values (69) | 209258 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 331067 | |
| Lowercase Letter | 66614 | 14.2% |
| Space Separator | 39298 | 8.4% |
| Other Punctuation | 21606 | 4.6% |
| Decimal Number | 10787 | 2.3% |
| Dash Punctuation | 649 | 0.1% |
| Connector Punctuation | 93 | < 0.1% |
| Open Punctuation | 15 | < 0.1% |
| Math Symbol | 9 | < 0.1% |
| Close Punctuation | 9 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 34898 | 10.5% |
| E | 32115 | 9.7% |
| O | 31367 | 9.5% |
| R | 25030 | 7.6% |
| P | 22822 | 6.9% |
| L | 21166 | 6.4% |
| C | 19752 | 6.0% |
| G | 17798 | 5.4% |
| I | 16645 | 5.0% |
| M | 13904 | 4.2% |
| Other values (16) | 95570 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10548 | |
| o | 5761 | 8.6% |
| e | 5140 | 7.7% |
| a | 4761 | 7.1% |
| t | 4051 | 6.1% |
| n | 3914 | 5.9% |
| d | 3119 | 4.7% |
| p | 3114 | 4.7% |
| c | 2523 | 3.8% |
| g | 2506 | 3.8% |
| Other values (16) | 21177 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10073 | |
| * | 9579 | |
| / | 1826 | 8.5% |
| & | 66 | 0.3% |
| , | 38 | 0.2% |
| @ | 11 | 0.1% |
| # | 9 | < 0.1% |
| : | 2 | < 0.1% |
| ? | 1 | < 0.1% |
| ! | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1741 | |
| 1 | 1522 | |
| 3 | 1151 | |
| 0 | 1064 | |
| 4 | 959 | |
| 7 | 955 | |
| 5 | 924 | |
| 6 | 875 | |
| 8 | 817 | |
| 9 | 779 |
Space Separator
| Value | Count | Frequency (%) |
| 39298 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 649 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 93 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 9 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 397681 | |
| Common | 72468 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 34898 | 8.8% |
| E | 32115 | 8.1% |
| O | 31367 | 7.9% |
| R | 25030 | 6.3% |
| P | 22822 | 5.7% |
| L | 21166 | 5.3% |
| C | 19752 | 5.0% |
| G | 17798 | 4.5% |
| I | 16645 | 4.2% |
| M | 13904 | 3.5% |
| Other values (42) | 162184 |
Common
| Value | Count | Frequency (%) |
| 39298 | ||
| . | 10073 | 13.9% |
| * | 9579 | 13.2% |
| / | 1826 | 2.5% |
| 2 | 1741 | 2.4% |
| 1 | 1522 | 2.1% |
| 3 | 1151 | 1.6% |
| 0 | 1064 | 1.5% |
| 4 | 959 | 1.3% |
| 7 | 955 | 1.3% |
| Other values (17) | 4300 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 470149 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 39298 | 8.4% | |
| A | 34898 | 7.4% |
| E | 32115 | 6.8% |
| O | 31367 | 6.7% |
| R | 25030 | 5.3% |
| P | 22822 | 4.9% |
| L | 21166 | 4.5% |
| C | 19752 | 4.2% |
| G | 17798 | 3.8% |
| I | 16645 | 3.5% |
| Other values (69) | 209258 |
FRAUDE
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 228.6 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29240 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29240 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29240 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 23392 | |
| 1 | 5848 | 20.0% |
| FRAUDE | F_AMOUNT_TRANSACTION | F_DOLLAR_AMOUNT | ID_TRX | N_ACQ_INSTITUTION_COUNTRY_CODE | N_CURRENCY_CODE_TRANSACTION | N_ENTRY_MODE | N_MERCHANT_TYPE | N_POINT_OF_SERV_COND_CODE | N_TRANSMISSION_DATE_AND_TIME | S_ENCRYPTED_PAN | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| FRAUDE | 1.000 | 0.148 | 0.236 | -0.005 | 0.097 | -0.015 | 0.138 | 0.073 | 0.000 | -0.005 | -0.129 |
| F_AMOUNT_TRANSACTION | 0.148 | 1.000 | 0.336 | -0.024 | -0.471 | -0.807 | 0.000 | -0.168 | 0.000 | -0.024 | -0.167 |
| F_DOLLAR_AMOUNT | 0.236 | 0.336 | 1.000 | -0.015 | 0.002 | 0.172 | 0.041 | 0.163 | 0.029 | -0.015 | -0.146 |
| ID_TRX | -0.005 | -0.024 | -0.015 | 1.000 | -0.061 | 0.016 | 0.066 | 0.014 | 0.024 | 1.000 | 0.268 |
| N_ACQ_INSTITUTION_COUNTRY_CODE | 0.097 | -0.471 | 0.002 | -0.061 | 1.000 | 0.530 | 0.385 | 0.256 | 0.128 | -0.061 | 0.009 |
| N_CURRENCY_CODE_TRANSACTION | -0.015 | -0.807 | 0.172 | 0.016 | 0.530 | 1.000 | 0.211 | 0.245 | 0.037 | 0.016 | 0.068 |
| N_ENTRY_MODE | 0.138 | 0.000 | 0.041 | 0.066 | 0.385 | 0.211 | 1.000 | -0.233 | 0.125 | -0.014 | 0.052 |
| N_MERCHANT_TYPE | 0.073 | -0.168 | 0.163 | 0.014 | 0.256 | 0.245 | -0.233 | 1.000 | 0.226 | 0.014 | 0.026 |
| N_POINT_OF_SERV_COND_CODE | 0.000 | 0.000 | 0.029 | 0.024 | 0.128 | 0.037 | 0.125 | 0.226 | 1.000 | -0.024 | 0.036 |
| N_TRANSMISSION_DATE_AND_TIME | -0.005 | -0.024 | -0.015 | 1.000 | -0.061 | 0.016 | -0.014 | 0.014 | -0.024 | 1.000 | 0.268 |
| S_ENCRYPTED_PAN | -0.129 | -0.167 | -0.146 | 0.268 | 0.009 | 0.068 | 0.052 | 0.026 | 0.036 | 0.268 | 1.000 |
| ID_TRX | S_PAN | S_ENCRYPTED_PAN | F_AMOUNT_TRANSACTION | N_CURRENCY_CODE_TRANSACTION | F_DOLLAR_AMOUNT | N_TRANSMISSION_DATE_AND_TIME | N_MERCHANT_TYPE | N_ACQ_INSTITUTION_COUNTRY_CODE | N_ENTRY_MODE | N_POINT_OF_SERV_COND_CODE | S_MERCHANT_LEGAL_NAME | FRAUDE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 508924809 | 488245******4567 | 1000065275814567 | 5.40 | 840 | 5.388 | 20201001190732 | 7372 | 840 | 10 | 59 | GOOGLE *GSUITE_inversi | 0 |
| 1 | 509028097 | 410443******1014 | 1000126292771014 | 3350.00 | 188 | 5.570 | 20201001194719 | 4121 | 372 | 102 | 59 | UBR* PENDING.UBER.COM | 0 |
| 2 | 511351088 | 434527******5015 | 1000007433745015 | 10.80 | 840 | 10.800 | 20201002095534 | 4816 | 840 | 10 | 59 | Google LLC GSUITE_iyta | 0 |
| 3 | 515833505 | 410443******0018 | 1000126530570018 | 4960.80 | 188 | 8.280 | 20201004101419 | 4814 | 188 | 10 | 59 | RAPPI | 0 |
| 4 | 519145453 | 478787******2328 | 1000020382542328 | 40250.00 | 188 | 67.177 | 20201005173241 | 5661 | 188 | 10 | 59 | FLEXI SHOES | 0 |
| 5 | 523991261 | 434527******8007 | 1000007856338007 | 5.99 | 840 | 5.990 | 20201007192629 | 5815 | 528 | 100 | 59 | Spotify P11A48F847 | 0 |
| 6 | 530513150 | 476528******7304 | 1000000477837304 | 29.24 | 840 | 29.240 | 20201010133314 | 5942 | 840 | 10 | 59 | Amazon Payments | 0 |
| 7 | 542249902 | 424905******2900 | 1000056602832900 | 9040.00 | 188 | 15.083 | 20201015182823 | 5812 | 372 | 102 | 59 | UBR* PENDING.UBER.COM | 0 |
| 8 | 542299586 | 424905******9439 | 1000044754719439 | 1405.00 | 188 | 2.344 | 20201015185509 | 4121 | 702 | 100 | 59 | DidiChuxing | 0 |
| 9 | 546057827 | 423087******6002 | 1000198882226002 | 1.09 | 840 | 1.099 | 20201017095506 | 5816 | 840 | 12 | 59 | GOOGLE*GARENA | 0 |
| ID_TRX | S_PAN | S_ENCRYPTED_PAN | F_AMOUNT_TRANSACTION | N_CURRENCY_CODE_TRANSACTION | F_DOLLAR_AMOUNT | N_TRANSMISSION_DATE_AND_TIME | N_MERCHANT_TYPE | N_ACQ_INSTITUTION_COUNTRY_CODE | N_ENTRY_MODE | N_POINT_OF_SERV_COND_CODE | S_MERCHANT_LEGAL_NAME | FRAUDE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29230 | 1618312961 | 411757******3034 | 1000051416573034 | 49.99 | 840 | 49.990 | 20211114042804 | 5815 | 840 | 100 | 59 | APPLE.COM/BILL | 1 |
| 29231 | 1623112246 | 494171******4087 | 1000249852384087 | 31.40 | 840 | 31.400 | 20211115194606 | 4215 | 591 | 10 | 59 | PEDIDOS YA | 1 |
| 29232 | 1623966109 | 477370******7006 | 1000043180407006 | 950.00 | 188 | 1.490 | 20211116084041 | 4121 | 372 | 102 | 59 | UBR* PENDING.UBER.COM | 0 |
| 29233 | 1631233148 | 423087******0006 | 1000195942220006 | 39.99 | 840 | 40.325 | 20211118180509 | 5818 | 840 | 100 | 59 | APPLE.COM/BILL | 0 |
| 29234 | 1634406940 | 415237******6308 | 1000252689406308 | 8616.45 | 340 | 357.449 | 20211119184835 | 5912 | 340 | 12 | 59 | FARMACIAS KIELSA FICOL | 1 |
| 29235 | 1640911789 | 423087******1008 | 1000198432161008 | 10.42 | 840 | 10.506 | 20211122111915 | 8999 | 591 | 10 | 59 | SLS TRIP | 0 |
| 29236 | 1641330999 | 423087******7004 | 1000198982267004 | 2.99 | 840 | 3.014 | 20211122133111 | 5816 | 826 | 102 | 59 | PAYPAL *BOACOMPRA | 0 |
| 29237 | 1654136929 | 423087******0006 | 1000117158860006 | 126.35 | 340 | 5.239 | 20211127131110 | 5816 | 826 | 102 | 59 | PAYPAL *BOACOMPRA | 0 |
| 29238 | 1655793681 | 478789******4134 | 1000199634524134 | 640.00 | 188 | 1.018 | 20211128080109 | 5816 | 528 | 10 | 59 | GOOGLE lucydream game | 0 |
| 29239 | 1662610375 | 472055******0785 | 1000248589180785 | 1.99 | 840 | 2.001 | 20211130192739 | 5735 | 840 | 100 | 59 | APPLE.COM/BILL | 0 |